Search CORE

208 research outputs found

Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing

Author: Brants T.
Favre B.
Garofolo J.S.
Gaur Y.
Gray S.S.
Lei X.
Mishra T.
Rousseau A.
Wang Y.-Y.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 05/12/2017
Field of study

The accuracy of Automated Speech Recognition (ASR) technology has improved, but it is still imperfect in many settings. Researchers who evaluate ASR performance often focus on improving the Word Error Rate (WER) metric, but WER has been found to have little correlation with human-subject performance on many applications. We propose a new captioning-focused evaluation metric that better predicts the impact of ASR recognition errors on the usability of automatically generated captions for people who are Deaf or Hard of Hearing (DHH). Through a user study with 30 DHH users, we compared our new metric with the traditional WER metric on a caption usability evaluation task. In a side-by-side comparison of pairs of ASR text output (with identical WER), the texts preferred by our new metric were preferred by DHH participants. Further, our metric had significantly higher correlation with DHH participants' subjective scores on the usability of a caption, as compared to the correlation between WER metric and participant subjective scores. This new metric could be used to select ASR systems for captioning applications, and it may be a better metric for ASR researchers to consider when optimizing ASR systems.Comment: 10 pages, 8 figures, published in ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17

arXiv.org e-Print Archive

Crossref

Multilingual Word Sense Induction to Improve Web Search Result Clustering

Author: Automatic S. H.
Brants T.
Ferraresi A.
Purandare A.
Steinberger R.
Uzuner O.
Zhang M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

In [12] a novel approach to Web search result clustering based on Word Sense Induction, i.e. the automatic discovery of word senses from raw text was presented; key to the proposed approach is the idea of, first, automatically in- ducing senses for the target query and, second, clustering the search results based on their semantic similarity to the word senses induced. In [1] we proposed an innovative Word Sense Induction method based on multilingual data; key to our approach was the idea that a multilingual context representation, where the context of the words is expanded by considering its translations in different languages, may im- prove the WSI results; the experiments showed a clear per- formance gain. In this paper we give some preliminary ideas to exploit our multilingual Word Sense Induction method to Web search result clustering

Crossref

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Beyond Textual Issues: Understanding the Usage and Impact of GitHub Reactions

Author: Aniche Maurício
Bissyandé T. F.
Brants Wesley
Coelho Jailton
Dabbish Laura A.
Kalliamvakou Eirini
Sharma A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/09/2019
Field of study

Recently, GitHub introduced a new social feature, named reactions, which are "pictorial characters" similar to emoji symbols widely used nowadays in text-based communications. Particularly, GitHub users can use a pre-defined set of such symbols to react to issues and pull requests. However, little is known about the real usage and impact of GitHub reactions. In this paper, we analyze the reactions provided by developers to more than 2.5 million issues and 9.7 million issue comments, in order to answer an extensive list of nine research questions about the usage and adoption of reactions. We show that reactions are being increasingly used by open source developers. Moreover, we also found that issues with reactions usually take more time to be handled and have longer discussions.Comment: 10 page

arXiv.org e-Print Archive

Crossref

Second-Order Belief Hidden Markov Models

Author: A. Aregui
D. Dubois
E. Ramasso
F. Fayad
G. Zhou
H. Soubaras
J. Kupiec
L.M. Lee
L.R. Rabiner
M.E.Y. Boudaren
P. Lanchantin
P. Smets
P. Smets
P. Smets
R.R. Yager
S.M. Thede
T. Brants
W. Pieczynski
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Hidden Markov Models (HMMs) are learning methods for pattern recognition. The probabilistic HMMs have been one of the most used techniques based on the Bayesian model. First-order probabilistic HMMs were adapted to the theory of belief functions such that Bayesian probabilities were replaced with mass functions. In this paper, we present a second-order Hidden Markov Model using belief functions. Previous works in belief HMMs have been focused on the first-order HMMs. We extend them to the second-order model

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

The Relationship Between Plasma Flow Doppler Velocities and Magnetic Field Parameters During the Emergence of Active Regions at the Solar Photospheric Level

Author: A. Khlystova
A. Khlystova
A. Lagg
A.A. Golovko
B.W. Lites
C. Zwaan
C. Zwaan
C.E. Parnell
C.S. Barth
D. Bonaccini
D. Chou
E.N. Frazier
E.N. Parker
G. Bachmann
H.B. Snodgrass
H.B. Snodgrass
H.J. Hagenaar
I. Kawaguchi
J.I. Garcia de La Rosa
J.J. Brants
J.J. Brants
J.J. Brants
K. Otsuji
K. Shibata
K.-S. Cho
K.L. Harvey
L.H. Strous
M. Kubo
P.H. Scherrer
R.K. Ulrich
S. Toriumi
S. Toriumi
S.A. Schoolman
S.I. Gopasyuk
S.I. Gopasyuk
S.K. Solanki
S.L. Guglielmino
S.L. Keil
T. Bai
T.D. Tarbell
V. Archontis
V.M. Grigor’ev
V.M. Grigor’ev
V.M. Grigor’ev
Y. Liu
Z. Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/12/2012
Field of study

A statistical study has been carried out of the relationship between plasma flow Doppler velocities and magnetic field parameters during the emergence of active regions at the solar photospheric level with data acquired by the Michelson Doppler Imager (MDI) onboard the Solar and Heliospheric Observatory (SOHO). We have investigated 224 emerging active regions with different spatial scales and positions on the solar disc. The following relationships for the first hours of the emergence of active regions have been analysed: i) of peak negative Doppler velocities with the position of the emerging active regions on the solar disc; ii) of peak plasma upflow and downflow Doppler velocities with the magnetic flux growth rate and magnetic field strength for the active regions emerging near the solar disc centre (the vertical component of plasma flows); iii) of peak positive and negative Doppler velocities with the magnetic flux growth rate and magnetic field strength for the active regions emerging near the limb (the horizontal component of plasma flows); iv) of the magnetic flux growth rate with the density of emerging magnetic flux; v) of the Doppler velocities and magnetic field parameters for the first hours of the appearance of active regions with the total unsigned magnetic flux at the maximum of their development.Comment: 14 pages, 8 figures. The results of article were presented at the ESPM-13 (12-16 September 2011, Rhodes, Greece, Abstract Book p. 102-103, P.4.13, http://astro.academyofathens.gr/espm13/documents/ESPM13_abstract_programme_book.pdf

arXiv.org e-Print Archive

Crossref

Structural Invariance of Sunspot Umbrae Over the Solar Cycle: 1993-2004

Author: A.A. Norton
A.A. Pevtsov
C.E. Thornton
D.-Y. Chou
E.H. Avrett
F. Albregtsen
F. Albregtsen
F. Socas-Navarro
G. Kopp
H. Alfvén
H.P. Jones
H.P. Jones
H.P. Jones
H.P. Jones
I. Rüedi
J.H.M.J. Bruls
J.J. Brants
K.J. Li
L. Biermann
M. J. Penn
M. Schüssler
M.J. Penn
M.J. Penn
M.J. Wesolowski
M.P. Rast
P.N. Brandt
S.K. Mathew
T. A. Schad
T. Leonard
T.J. Bogdan
V. Martínez Pillet
W. Deinzer
W. Livingston
W. Livingston
Y.-J. Moon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/08/2010
Field of study

Measurements of maximum magnetic flux, minimum intensity, and size are presented for 12 967 sunspot umbrae detected on the NASA/NSO spectromagnetograms between 1993 and 2004 to study umbral structure and strength during the solar cycle. The umbrae are selected using an automated thresholding technique. Measured umbral intensities are first corrected for a confirming observation of umbral limb-darkening. Log-normal fits to the observed size distribution confirm that the size spectrum shape does not vary with time. The intensity-magnetic flux relationship is found to be steady over the solar cycle. The dependence of umbral size on the magnetic flux and minimum intensity are also independent of cycle phase and give linear and quadratic relations, respectively. While the large sample size does show a low amplitude oscillation in the mean minimum intensity and maximum magnetic flux correlated with the solar cycle, this can be explained in terms of variations in the mean umbral size. These size variations, however, are small and do not substantiate a meaningful change in the size spectrum of the umbrae generated by the Sun. Thus, in contrast to previous reports, the observations suggest the equilibrium structure, as testified by the invariant size-magnetic field relationship, as well as the mean size (i.e. strength) of sunspot umbrae do not significantly depend on solar cycle phase.Comment: 17 pages, 6 figures. Published in Solar Physic

arXiv.org e-Print Archive

Crossref

Resolving the infinitude controversy

Author: A Kornai
A Kornai
A Nevins
András Kornai
D Everett
GK Pullum
J Watumull
M Tomalin
MD Hauser
N Chomsky
P Brown
P Erdös
R Jackendoff
R Lakoff
R Levy
RM Dixon
SR Anderson
T Brants
T Langendoen
T Łuczak
TF Jaeger
WN Francis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

A simple inductive argument shows natural languages to have infinitly many sentences, but workers in the field have uncovered clear evidence of a diverse group of ‘exceptional’ languages from Proto-Uralic to Dyirbal and most recently, Pirahã, that appear to lack recursive devices entirely. We argue that in an information-theoretic setting non-recursive natural languages appear neither exceptional nor functionally inferior to the recursive majority

Crossref

SZTAKI Publication Repository

A prospective cohort study on the relation between meat consumption and the risk of colon cancer

Author: Brants H.A.M.
Dorant E.
Goldbohm R.A.
Hermus R.J.J.
Sturmans F.
van 't Veer P.
van den Brandt P.A.
Publication venue
Publication date: 01/01/1994
Field of study

Maastricht University Research Portal

Higher dietary flavone, flavonol, and catechin intakes are associated with less of an increase in BMI over time in women: a longitudinal analysis from the Netherlands Cohort Study

Author: Ambergen T.
Arts I.C.
Brants H.A.
Dagnelie P.C.
Goldbohm R.A.
Hughes L.A.E.
van den Brandt P.A.
Weijenberg M.P.
Publication venue: 'American Society for Nutrition'
Publication date: 01/01/2008
Field of study

BACKGROUND: Dietary flavonoids are suggested to have antiobesity effects. Prospective evidence of an association between flavonoids and body mass index (BMI) is lacking in general populations. OBJECTIVE: We assessed this association between 3 flavonoid subgroups and BMI over a 14-y period in 4280 men and women aged 55-69 y at baseline from the Netherlands Cohort Study. DESIGN: Dietary intake was estimated at baseline (1986) by a validated food-frequency questionnaire. BMI was ascertained through self-reported height (in 1986) and weight (in 1986, 1992, and 2000). Analyses were based on sex-specific quintiles for the total intake of 6 catechins and of 3 flavonols/flavones. Linear mixed effect modeling was used to assess longitudinal associations in 3 adjusted models: age only, lifestyle (age, energy intake, physical activity, smoking status, alcohol intake, type 2 diabetes, and coffee consumption), and lifestyle and diet (vegetables, fruit, fiber, grains, sugar, dessert, and dieting habits). RESULTS: After adjustment for age and confounders, the BMI (kg/m(2)) of women with the lowest intake of total flavonols/flavones and total catechins increased by 0.95 and 0.77, respectively, after 14 y. Women with the highest intake of total flavonols/flavones and total catechins experienced a significantly lower increase in BMI of 0.40 and 0.31, respectively (between group difference: P < 0.05). This difference remained after additional adjustment for dietary determinants and after stratification of median baseline BMI. In men, no significant differences in BMI change were observed over the quintiles of flavonoid intake after 14 y. CONCLUSION: Our results suggest that flavonoid intake may contribute to maintaining body weight in the general female population. AD - .s FAU - Hughes, Laura A E AU - CN - Netherlands Cohort Study LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't PL - United States TA - Am J Clin Nutr JT - The American journal of clinical nutrition JID - 0376027 SB - AIM SB - I

Maastricht University Research Portal

Positive words carry less information than negative words

Author: A Chmiel
AA Augustine
B Pang
B Rime
CJ Bryan
D Garcia
D Lazer
ES Knowles
F Schweitzer
GJ Norman
GK Zipf
GK Zipf
H Kucera
IM Kloumann
J Bohannon
J Bollen
J Boucher
J Redondo
J Reilly
J Reilly
JA Russell
JA Russell
JP Robinson
K Kosmidis
M Taboada
M Thelwall
M Yik
MD Hauser
ML-H Võ
MM Bradley
N Sebastián
NE Miller
P Rozin
PM Bentler
PS Dodds
PS Dodds
R Ferrer i Cancho
RH Baayen
S Havlin
SA Golder
ST Piantadosi
ST Piantadosi
T Brants
TF Jaeger
TL Griffiths
TM Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We show that the frequency of word use is not only determined by the word length \cite{Zipf1935} and the average information content \cite{Piantadosi2011}, but also by its emotional content. We have analyzed three established lexica of affective word usage in English, German, and Spanish, to verify that these lexica have a neutral, unbiased, emotional content. Taking into account the frequency of word usage, we find that words with a positive emotional content are more frequently used. This lends support to Pollyanna hypothesis \cite{Boucher1969} that there should be a positive bias in human expression. We also find that negative words contain more information than positive words, as the informativeness of a word increases uniformly with its valence decrease. Our findings support earlier conjectures about (i) the relation between word frequency and information content, and (ii) the impact of positive emotions on communication and social links.Comment: 16 pages, 3 figures, 3 table

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

Springer - Publisher Connector